Measures of classification complexity based on neighborhood model
نویسندگان
چکیده
It is useful to measure classification complexity for understanding classification tasks, selecting feature subsets and learning algorithms. In this work, we review some current measures of classification complexity and propose two new coefficients: neighborhood dependency (ND) and neighborhood decision error (NDEM). ND reflects the ratio of boundary samples over the whole sample set; while NDEM is the decision error rate based on neighborhood local information of samples. We introduce neighborhood rough set model to define and compute decision boundary, furthermore compute NDEM. As one hopes to find the feature subspace where the classification task is with the least complexity, we construct a feature selection algorithm based on the proposed measures and sequentially forward selection. Experimental results show that NDEM correlates best with classification error rate from well-known classifiers compared to current measures and ND. Accordingly, NDEM can select the minimal feature subset with comparative classification performance.
منابع مشابه
Detection of Fake Accounts in Social Networks Based on One Class Classification
Detection of fake accounts on social networks is a challenging process. The previous methods in identification of fake accounts have not considered the strength of the users’ communications, hence reducing their efficiency. In this work, we are going to present a detection method based on the users’ similarities considering the network communications of the users. In the first step, similarity ...
متن کاملON FUZZY NEIGHBORHOOD BASED CLUSTERING ALGORITHM WITH LOW COMPLEXITY
The main purpose of this paper is to achieve improvement in thespeed of Fuzzy Joint Points (FJP) algorithm. Since FJP approach is a basisfor fuzzy neighborhood based clustering algorithms such as Noise-Robust FJP(NRFJP) and Fuzzy Neighborhood DBSCAN (FN-DBSCAN), improving FJPalgorithm would an important achievement in terms of these FJP-based meth-ods. Although FJP has many advantages such as r...
متن کامل3D Scene and Object Classification Based on Information Complexity of Depth Data
In this paper the problem of 3D scene and object classification from depth data is addressed. In contrast to high-dimensional feature-based representation, the depth data is described in a low dimensional space. In order to remedy the curse of dimensionality problem, the depth data is described by a sparse model over a learned dictionary. Exploiting the algorithmic information theory, a new def...
متن کاملCognitive Task Complexity and Iranian EFL Learners’ Written Linguistic Performance across Writing Proficiency Levels
Recently tasks, as the basic units of syllabi, and the cognitive complexity, as the criterion for sequencing them, have caught many second language researchers’ attention. This study sought to explore the effect of utilizing the cognitively simple and complex tasks on high- and low-proficient EFL Iranian writers’ linguistic performance, i.e., fluency, accuracy, lexical complexity, and structura...
متن کاملMultiresolution Estimates of Classification Complexity
In this paper, we study two measures of classification complexity based on feature space partitioning: “purity” and “neighborhood separability.” The new measures of complexity are compared with probabilistic distance measures and a number of other nonparametric estimates of classification complexity on a total of 10 databases from the University of Calfornia, Irvine, (UCI)
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007